Practical Algorithmic Techniques for Several String Processing Problems
نویسندگان
چکیده
The domains of data mining and knowledge discovery make use of large amounts of textual data, which need to be handled efficiently. Specific problems, like finding the maximum weight ordered common subset of a set of ordered sets or searching for specific patterns within texts, occur frequently in this context. In this paper we present several novel and practical algorithmic techniques for processing textual data (strings) in order to efficiently solve multiple problems. Our techniques make use of efficient string algorithms and data structures, like KMP, suffix arrays, tries and deterministic finite automata. Keywords-string processing; prefix query; trie; suffix array; KMP; deterministic finite automaton
منابع مشابه
Algorithmic Techniques for Several Optimization Problems Regarding Distributed Systems with Tree Topologies
As the development of distributed systems progresses, more and more challenges arise and the need for developing optimized systems and for optimizing existing systems from multiple perspectives becomes more stringent. In this paper I present novel algorithmic techniques for solving several optimization problems regarding distributed systems with tree topologies. I address topics like: reliabili...
متن کاملAlgorithmic Solutions for Several Offline Constrained Resource Processing and Data Transfer Multicriteria Optimization Problems
In this paper we present novel algorithmic solutions for several resource processing and data transfer multicriteria optimization problems. The results of most of the presented techniques are strategies which solve the considered problems (almost) optimally. Thus, the developed algorithms construct intelligent strategies which can be implemented by agents in specific situations. All the describ...
متن کاملThe Monge array-an abstraction and its applications
This thesis develops a body of versatile algorithmic techniques. We demonstrate the power and generality of these techniques by applying them to a wide variety of problems. These problems are drawn from such diverse areas of study as computational geometry, VLSI theory, operations research, and molecular biology. The algorithmic techniques described in this thesis are centered around a family o...
متن کاملAlgorithmic Techniques for Processing Data Streams
We give a survey at some algorithmic techniques for processing data streams. After covering the basic methods of sampling and sketching, we present more evolved procedures that resort on those basic ones. In particular, we examine algorithmic schemes for similarity mining, the concept of group testing, and techniques for clustering and summarizing data streams. 1998 ACM Subject Classification F...
متن کاملReal Time Implementation of a License Plate Location Recognition System Based on Adaptive Morphology
License plate recognition (LPR) by using morphology has the advantage of resistance to brightness changes; high speed processing, and low complexity. However these approaches are sensitive to the distance of the plate from the camera and imaging angle. Various assumptions reported in other works might be unrealistic and cause major problems in practical experiences. In this paper we considered ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/0912.0807 شماره
صفحات -
تاریخ انتشار 2009